Machine Translation Approaches and Survey for Indian Languages

نویسنده

  • P. J. Antony
چکیده

The term Machine Translation is a standard name for computerized systems responsible for the production of translations from one natural language into another with or without human assistance. It is a sub-field of computational linguistics that investigates the use of computer software to translate text or speech from one natural language to another. Many attempts are being made all over the world to develop machine translation systems for various languages using rule-based as well as statistically based approaches. Development of a full-fledged bilingual machine translation (MT) system for any two natural languages with limited electronic resources and tools is a challenging and demanding task. In order to achieve reasonable translation quality in open source tasks, corpus based machine translation approaches require large amounts of parallel corpora that are not always available, especially for less resourced language pairs. On the other hand, the rule-based machine translation process is extremely time consuming, difficult, and fails to analyze accurately a large corpus of unrestricted text. Even though there has been effort towards building English to Indian language and Indian language to Indian language translation system, unfortunately, we do not have an efficient translation system as of today. The literature shows that there have been many attempts in MT for English to Indian languages and Indian languages to Indian languages. At present, a number of government and private sector projects are working towards developing a full-fledged MT for Indian languages. This paper gives a brief description of the various approaches and major machine translation developments in India.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Survey on Machine Transliteration and Machine Learning Models

Globalization and growth of Internet users truly demands for almost all internet based applications to support local languages. Support of local languages can be given in all internet based applications by means of Machine Transliteration and Machine Translation. This paper provides the thorough survey on machine transliteration models and machine learning approaches used for machine transliter...

متن کامل

An Empirical Survey on Automatic Machine Translation between English and Indian Languages

In this paper, we have reported our survey on systems and projects that intend to translate between English and Indian languages. Most of the translators and projects aim to translate from English to more than one Indian languages. The main challenge is due to the fact that Indian languages are quite different from European languages. In this paper, we have explored the following the following ...

متن کامل

Survey: Natural Language Parsing For Indian Languages

Syntactic parsing is a necessary task which is required for NLP applications including machine translation. It is a challenging task to develop a qualitative parser for morphological rich and agglutinative languages. Syntactic analysis is used to understand the grammatical structure of a natural language sentence. It outputs all the grammatical information of each word and its constituent. Also...

متن کامل

Approaches to Named Entity Recognition in Indian Languages: A Study

Named Entity Recognition (NER) is subtask of information extraction that seeks to locate and classify the elements in some text into pre-defined categories. NER finds its application in Natural Language Processing tasks like machine translation, question-answering systems and automatic summarization. The approaches to NER are rule based, statistics based or a combination of both. In this paper,...

متن کامل

A Hybrid Approach to Example based Machine Translation for Indian Languages

Corpus based approaches to machine translation namely Example based machine translation and Statistical machine translation have received wide focus in the recent years. Hybrid approaches combining the two further improved the performance. Indian language machine translation has mostly focussed on rule based machine translation. We propose a hybrid approach to Example based machine translation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJCLCLP

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2013